arXiv:1508.07267v2 [math.PR] 12 Sep 2015 


Functional van den Berg-Kesten-Reimer Inequalities and 
their Duals, with Applications 

Larry Goldstein and Yosef Rinott* * 

University of Southern California, Hebrew University of Jerusalem 

September 15, 2015 


Abbreviated Title: Functional BKR Inequalities 
Abstract 

The BKR inequality conjectured by van den Berg and Kesten in [llj . and proved 
by Reimer in [8], states that for A and B events on S, a finite product of finite sets 
Si,i = 1,... ,n, and P any product measure on S, 

P(AaB) < F(A)P(B), 

where the set ADR consists of the elementary events which lie in both A and B for 
‘disjoint reasons.’ Precisely, with n := n} and K <Z n, for x S 5 letting 

[x]i^ = {y £ S : yi = Xi,i € K}, the set ADR consists of all x € 5 for which there 
exist disjoint subsets K and L of n for which [x];^ C A and [x]/, C R. 

The BKR inequality is extended to the following functional version on a general 
finite product measure space {S, §) with product probability measure P, 


E { max / (X)ff (X) V < E {/(X)} E U(X)} , 

{KCn,LCn ) 

where / and g are non-negative measurable functions, /^(x) = ess mfy^j^j^^/(y) 
and 9^(x) = essinfygjxji, The original BKR inequality is recovered by taking 

/(x) = 1 a(x) and g(x) = 1_b(x), and applying the fact that in general Iaub < 

max;^nL=0 (x) • 

Related formulations, and functional versions of the dual inequality on events by 
Kahn, Saks, and Smyth [6], are also considered. Applications include order statistics, 
assignment problems, and paths in random graphs. 
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1 Introduction 


This paper is a minor revision of |S], correcting an error in equation (E]) that was pointed out 
to us by Richard Arratia during the preparation of [2], a draft of which he shared with us. 
Inequality (|5]) was earlier incorrectly stated as an equality. While the correction is important 
on its own, the error is inconsequential for the purposes of our original work. As ([5]) was 
only applied to show fl20|) , for which we now provide a much simpler argument not involving 
([5]), no results depended on its validity. In particular, the statements of all theorems here 
are the same as in |5]. 

For X = (xi,..., Xn) e S, where S = IliLi product space, and K = {ki ,..., km} C 

n := {1, ... , n} with ki < ■ ■ ■ < kmj dehne 

xx = {xki, ..., Xk^) and [x]x = {y e S : yx = xr}, 

the restriction of x to the indicated coordinates, and the collection of all elements in S which 
agree with x in those coordinates, respectively. For A, R C S' we say that x G A, y G R 
disjointly if there exist 

A", L C n, K n L = 0 such that [x]j^ C A and [y]i C B, (1) 

and denote 

AHB = {x : X G A, X G A disjointly}. (2) 

The operation ADR corresponds to elementary events which are in both A and B for disjoint 
‘reasons’ in the sense that inclusion in A and B is determined on disjoint sets of coordinates. 

Theorem 11.11 was conjectured in van den Berg and Kesten [TT]. It was proved in [TT] for A 
and B increasing sets and S = {0,1}"’, and it was also demonstrated there that Theorem ll.il 
follows from its special case S = {0,1}”. Using the latter fact, the conjecture was established 
in general by Reimer |8]. 

Theorem 1.1. For P = nr=i product measure on S = nr=i finite, 

P{AnB) < P{A)P{B). (3) 

Many useful formulations can be found in van den Berg and Fiebig [10], in addition to 
the following motivating example which appeared earlier in [TT]. Independently assign a 
random direction to each edge e = {vi,Vj} of a hnite graph, with pfivi,Vj) = 1 — pfivj^Vi) 
the probability of the edge e being directed from vertex Vi to Vj. With Ui, 14, TFi, W 2 sets of 
vertices. Theorem 11.11 yields that the product of the probabilities that there exist directed 
paths from VT to 14 (event A) and from Wi to 144 (event B) is an upper bound to the 
probability that there exist two disjoint directed paths, one from I 4 to I 4 and another from 
IFi to 144 (event ADR). 

The main thrust of this paper is to show how Theorem 11.11 implies inequalities in terms of 
functions, of which (|3]) is the special case of indicators, and similarly for the dual inequality 
of [6]. These functional inequalities, and their duals, are stated in Theorems II. 21 and II.51 and 
their proofs can be found in Section |3l Applications to order statistics, allocation problems, 
and random graphs are given in Section [2l Specializing to monotone functions, we derive 
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related inequalities and stochastic orderings in Section 01 these latter results are connected 
to those of Alexander [1]. 

For each ^ = 1,..., n, let {Si, Si) be measurable spaces, and set S = nr=i ^ ~ 

0"=! product sigma algebra. Henceforth, all given real valued functions on S, such 

as fa, 90 , a G A, (3 G B are assumed to be (§, B) measurable where B denotes the Borel 
sigma algebra of R, and functions on S with values in 2“, such as /F(x) in inequality (d) 
of Theorem 11.21 below, are assumed to be (§, 2^") measurable. Measurability issues arise 
in definitions ([9]), (j4]), and ffT6|) . and are settled in Section [5l We also show in Section [5] 
that Theorem 11.21 applies to the completion of the measure space (S', S) with respect to the 
measure P appearing in the theorem; similarly for Theorem 11.51 

For K O n we say that a function / defined on S depends on K if Xi^ = yx implies 
/(x) = /(y). The inequalities in Theorems 11.21 and 11.51 require one of two frameworks, the 
hrst of which is the following. 

Framework 1. {fa{'^)}aeA {g 0 {y)} 0^13 are given collections of non-negative functions 

on S, such that fa, 90 depend respectively on subsets of n Ka,L 0 in K, = {Ka}a&A 
C = where A and B are finite or countable. 

The elements of K. and C are not assumed to be distinct; we may have, say, Ka = 
for some ol ^ ^ and /„ 7 ^ f^. Note also that if a function depends on K, it depends on any 
subset of n containing K. For notational brevity we may write a for Ka] for example, we 
may use a fl /? as an abbreviation for Ka H Ly, and also Xq, for xx^. 

The second framework is 

Framework 2. / and g are two given non-negative functions, and K and C are any subsets 
ofT^. With P a probability measure on (S', §) define for iF G /C, L G £, 

/ (x)=ess inf /(y), and ^ (x) = ess inf ^(y), (4) 

yG[x]K yG[x]i 

where the essential infimums for /^(x) and g^i'x.) are being taken with respect to the product 
probability measure on the coordinates in K^ and K respectively. 

Our functional extension of the BKR inequality ([3]) is 

Theorem 1.2. Let X = (Xi,..., X„) E S be a random vector and P a probability measure 
on {S, S) under which Xi,..., X„ are independent. 

1. Under frameworkUl 


Ei sup fa{^)g 0 {X) }<E 

on^=0 


sup/„(X) E 


supgyiX) 

h 


2. Under framework\^ 


E 


max 4(X)^7jX) V < F{/(X)} F{^(X)}. 

KnL=<I) —^ I 

KeJC^Lec I 


(a) 


{b) 
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The special case of (b) where /C = £ = 2", the collection of all subsets of n, clearly 
implies the inequality in general. 

In 0. inequality ([3]) for the □ operation was proven only for discrete hnite product 
spaces, that is, a hnite product of hnite sets; Theorem 11.21 applies to functions of a vector X 
having independent coordinates taking values in any measure space. For /(x) = ly 4 (x) and 
5f(x) = 1b(x) for A, B E E), we have 

max^4(x)^Jx). (5) 

To see note that replacing essential inhmum by inhmum in (jl]), the inequality becomes 
equality. Hence (jS]) holds as stated because the essential inhmum is at least as large as the 
inhmum. In other words, elements of A\I\B demand disjoint ‘reasons’ for A and B that hold 
for all outcomes in the probability space, while the right hand side of (jS]) only requires that 
the ‘reasons’ be almost sure. 

In [3], the specialization of (b) to indicator functions and K, = C = 2", and ‘equality’ in 
m, was interpreted to mean that inequality ([3]) holds for general product spaces. However, 
as (b) is an inequality, this interpretation now yields that for all A, B E E the set in S whose 
indicator appears in the right hand side of (jS]) contains AOB and has probability bounded 
above by the product of the probabilities of A and B. This latter interpretation appears in 
Corollary 4 of |2] when § is taken to be Euclidean space. 

The following is a straightforward generalization of Theorem 11.21 stated here only for 
inequality (a). Note that in the inequality below, as m increases the pairwise constraints 
ai n ttj = 0 become more restrictive, and the inequality less sharp. 

Theorem 1.3. Let X. E S be a random vector with independent coordinates. Then for given 
finite or countable collections of non-negative functions {fi,a}a<£Ai depending on {i^i,a}aGA ? 
i = 1,..., m, 


E < sup 

X--xAn 

afena/=0, k^l 


n 




< 


2=1 


m ^ 

TT E i sup fi^, 
UeA 


fxi 


2 = 1 


Next we describe an inequality of Kahn, Saks, and Smyth [6] , which may be considered 
dual to the BKR inequality ([2]), and then provide a function version. We use a notation 
compatible with ([3]). With ‘disjointly’ dehned in ([T]), denote 


A<)B = {(x,y) : X G H,y G H disjointly}. 


Note that 

lAnB(x) = lA0ij(x,x). 

The following, which we call the KSS inequality, is dual to Theorem 11.11 and is given in 

[Si- 

Theorem 1.4. If P denotes the uniform measure over {0,1}”, then for any {A,B) C 
{0 ,l}”x{0 ,l}”, 

{P X P){A0B) < P{AnB). (6) 
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Our functional extension of the KSS inequality is as follows. 

Theorem 1.5. Let X = (Xi,... ,X„) E S be a random vector, P any probability measure 
on (S', S) such that Xi,... are independent, and Y an independent copy ofX.. 

1. Under frameworkUi 


sup UX)gp{Y)} <E\snpUX)gy{X) 


an/3=0 


I 


2. Under framework\^ 


E 


max 

KnL=iti 

K€K,L&C 






<E{f{X)g{X)}. 


ip') 


The 0 operation in Theorem II.41 on {0,1}" x {0,1}"^ was dehned implicitly in |6], and the 
inequality was extended there to product measure on discrete hnite product spaces. With 
/(x) and gipU) the indicator functions of sets A and B respectively, we have the inequality 

lA0i?(x,y)< max /^(x)^^(y), (7) 

where f ,.,g^ are given in (jl]). Therefore, inequality (b') of Theorem 11.51 specialized to the 
case where K, = C = 2^ and / and g are indicators says that the original KSS inequality ()6]) 
for events in discrete hnite product spaces extends to vectors having independent coordinates 
taking values in any measure space in the sense that A()B is a subset of the set whose 
indicator is the fnnction appearing on the right hand side of (JTj), and has probability bonnded 
by P{AnB), by (b’)- 

We next discuss further formulations of Theorems 11.21 and 11.51 which are of independent 
interest, and will be used in the proof. Under Framework (H for any snbsets K and L of n, 
dehne 

/a'(x) = sup /„(x) and ^l(x) = snp gyi^). (8) 

aiKa^K 

For any given fnnctions X(x) and F(x) dehned on S and taking valnes in 2”, nnder Frame¬ 
work (H extend ([8]) to 

/x(x)(x) = sup /a(x) and ^l(x)(x) = sup gpi^), (9) 

a:KaCK{x) li-.Lp(ZL{-x) 

and nnder Framework [21 extend (|1]) to 


%x)W=ess inf ^^(y) 

yG[x]x(x) yG[x]L(^) 


( 10 ) 


Proposition 13.II shows that parts (c) and (d) of Proposition [TTT] are reformulations of (a) of 
Theorem 11.21 and likewise (e) a reformnlation of (b). 
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Proposition 1.1. Let the hypotheses of Theorem \1.2\ hold. In Framework{J\ (c) and (d) 
below obtain. 


E 


max fK{'X.)gL{X) 

KnL=iti 

K€K,LeC 


< 


E sup fa 

I “ 




E 


\ SUP^/3(X) 

I h 


{c) 


i?{/x(x)(X)^L(x)(X)} <e|sup/„(X)| i?|sjr/;5(X)|, {d) 

holding for any given K{yTj G /C and L(x.) G C such that /t'(x) fl L^x.) = 0. 

In Framework\^ we have 

®{£a-(X)(X)^(X)(X)} < B{/(X)}B{9(X)}, (e) 

holding for any given K{x) G K. and L{x) G C such that K{x) fl L{x) = 0. 

As in Theorem 11.21 the special cases of (c) and (d) where /C and C both equal 2" implies 
the inequality in general. Similarly, the special case of inequality (e) with K = C = 2^ and 
L(x) = K'^{x), where denotes the complement of K, yields the inequality in general, that 
is, (e) is equivalent to the statement that for any given K{x.), 

®{£x(x)(X)&.(x)(^)} - E{f{X)]E{g(,X)]. 

Parallel to the claims of Proposition 11.11 parts (c') and (d') below are reformulations of 
(a') of Theorem 11.51 and (e') a reformulation of (b') . For given / and a function A'(x,y) 
taking values in 2", dehne fK(x,Y) and y) replacing K{x) by iF(x, y) in ([2]) and (ITUD 
respectively. 

Proposition 1.2. Let the hypotheses of Theorem \1.5\ hold. In Framework{l\ (o') and (d') 
below obtain. 

b( max /x(X)9i(Y)|<Bi sup UX)gg(X) 

i?|/x(X,Y)(X)^L(X,Y)(Y)| <e| sup fa{X)gg{X)\, {d') 

[^on/3=0 J 

holding for any given K{x,y) G /C and L(x, y) G C replacing K{x) and T(x) in respec¬ 
tively, and satisfying iP(x, y) n L(x, y) = 0. 

In Framework\^ we have 

^5{Zx,x,v,(X)9a,x,v,(Y)} < £{/(X)9(X)}, (e') 

for given K{x,y) G /C and L(x, y) G C replacing K(x) and T(x) in l[Td\) . respectively. 
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2 Applications 

Example 2.1. Order Statistics Type Inequalities Let X = {Xi,..., Xn) be a vector of 
independent non-negative random variables with associated order statistics < • • • < X[i]. 
Let A = B be the collection of all the singletons a E n and fa{'^) = ga(x) = Then 

max/«(X) = X[i], max fa{X)gg{X) = X[i]X[2], 
d an/3=0 

and ineguality (a) of Theorem M.B provides the middle ineguality in the string 


< EX[i]X[ 2] < (EX[i])2 < EXf,y 

The leftmost ineguality is true since order statistics are always positively correlated (moreover 
they are associated as defined by Esary et al and even MTP 2 , see Karlin and Rinott 
the rightmost ineguality follows from Jensen. 

Theorem M.^ allows a large variety of extensions of this basic order statistics ineguality. 
For example, taking A and B to be all k and I subsets of n respectively, then with 

/a(x) = and gy{x.) = (11) 

jea 

we derive 



Dropping the non-negativity assumption on Xi,..., Xn, we have for all t > 0, 

with Yi’s being independent copies of the Xi’s. Likewise, for all t > 0, 

Moment generating function and Laplace orders are discussed in Shaked and Shanthikumar 

a- 

Returning to non-negative variables, a variation of / fTTl) follows by replacing products with 
sums, that is, 

/«(x) = and gy{^) = ^xj, (12) 

jea j£0 

which for, k = I = 2 say, yields 

^ {■ k 2 ^ + ^[j])(^[k] + ^[i]) < [-^(^[ 1 ] + ^[ 2 ])]^- 

Though the maximizing indices on the left hand side will be {1, 2, 3,4} as indicated, the choice 
is not fixed and depends on the X’s; note, for example, that (X[i] +X[ 2 ])(X[ 3 ] +X[ 4 ]) is never 
maximal apart from degenerate cases. 
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Definition m and / fl^) are special cases where f and g are increasing non-negative 
functions of k and I variables and 

/a(x) = fM and gfi{x.) = gi^^); (13) 

when f and g are symmetric, 

E max 


We now give an example which demonstrates that these order statistics type inequalities 
can he considered in higher dimensions. Let Xi,..., X„ be independent vectors in R"*, and 
for a,/9 C n with |a| = |/9| = 3 let fa and gy he given as in where /(xi,X 2 ,X 3 ) = 

5 f(xi,X 2 ,X 3 ) is, say, the area of the triangle formed by the given three vectors. Theorem M.B 
gives that the expected greatest product of the areas of two triangles with distinct vertices is 
bounded above by the square of the expectation of the largest triangular area. 

To explore the dual inequality in these settings, let IX. be a vector of independent variables 
with support contained in [0,1], and Y an independent copy. With A = B the collections of 
all singletons a in n, and fai^) = Xa^gyipL) = 1 — xy, inequality (a') of Theorem \1.5\ gives 


E <! maxX„(l < ^X[i](l - X[„]). 


(14) 


Note that Xaifi — Y^) -^[i](l — ^n])/ hhe right hand side might he larger because of 


the restriction a ^ fi. Removing the restriction a ^ fi reverses (If), that is. 


EX[1](1-X[„]) < EX[1]£;(1-X[„]) = EX[1]E(1-Y[„]) = EX[1](1-Y[„]) = E |maxX„(l -Y^)) , 

where the inequality follows by the negative association of X[i] and 1 — X[„]. 

Following our treatment of applications of Theorem I we can extend [Iff ) as follows: 
with A and B the collection of all k and I subsets of n respectively, and 


/a(x) = JJ Xj and gy{^) = JJ(1 - xfi, 

jea j&0 

we obtain 



We now consider resource allocation problems of the following type. Suppose that two 
projects A and B have to be completed using n available resources represented by the 
components of a vector x. Each resource can be used for at most one project, and an 
allocation is given by a specihcation of disjoint subsets of resources. For any given subsets 
a,/5 C n, let /^(x) and gyipL) count the number of ways that projects A and B can be 
completed using the resources x^ and x^ respectively. The exact dehnitions of the projects 
and the counts are immaterial; in particular larger sets do not necessarily imply more ways to 










carry out a project. For an allocation a, /5, a fl /5 = 0, the total number of ways to carry out 
the two projects together is the product /„(x) 5 f^(x). When the resources are independent 
variables, inequality (a) of Theorem 11.21 bounds the expected maximal number of ways of 
completing A and B together, by the product of the expectations of the maximal number 
of ways of completing each project alone. The bound is simple in the sense that it does 
not require understanding of the relation between the two projects. In particular, it can be 
computed without knowledge of the optimal allocation of resources. 

Example 2.2. With J a list of tasks, consider fulfilling the set of tasks on (not necessarily 
disjoint) lists A O J and B O J, in two distant cities using disjoint sets of workers chosen 
from 1, 2,..., n. Each worker may he sent to one of the cities and assigned a single task or 
a set of tasks which he can perform. A worker may be gualified to fill more than one set of 
tasks. For z G n, let Xj C 2“^ he the collection of possible assignments of tasks for worker i, 
(that is, the sets of tasks worker i is gualified to fulfill); For a, /3 C n and x = (xi,..., x„), 
let /a(x) egual the number of ways the collection of workers a can complete A, and gyi^x.) 
the number of ways the collection j3 can complete B. When the gualifications Xj,i G n are 
independent, Theorem M.2\ bounds the expectation of the maximal number of ways of fulfilling 
the task reguirements in both cities, by the product of the expectations of the maximal numbers 
of ways that the reguirements in each collection can be separately satisfied. 

Example 2.3. Paths on Graphs Consider a graph Q with an arbitrary fixed vertex set V = 
{ui,... ,Vn}, where for each pair of vertices the existence of the edge {vi,Vj} is determined 
independently using a probability rule based on Vi,Vj, perhaps depending only on d{{vi,Vj}) 
for some function d. Let X = where is the indicator that there exists an 

edge between Vi and Vj. For instance, with V C R™ and Zpjp 1 < i,j < n independent 
non-negative variables, we may take for Vi, Vj G V, 

Xpj} = l(d({uj, Uj}) < Zpj}) where d{{vi,Vj}) = \\vi — Vj\\ (Euclidean distance) . 

Note that since the variables do not have to be identically distributed, we can set Zi^i = 0 
and avoid self loops should we wish to do so. 

Let a path in the graph Q from u to w be any ordered tuple of vertices ,..., Vi^ with 
Uii = u,Vi^ = w and = 1 for k = 1,... ,p - 1, and having all edges 

distinct. For u,v and w in V and a, (3 ■ I < i, j < n}, let fa((X.) be the number 

of paths in the graph from u to v which use only edges {vi,Vj} for {i,j} G a; in the same 
manner, let gyfX.) be the number of paths in the graph from v to w which use only edges 
{vi,Vj] for {i,j] G fi. 

The “projects” A and B in this framework are to create paths from u to v using a, and 
from V to w using (3, respectively, which combine together, when a H (3 = (/), to give the 
overall project of creating a path from u to w passing through v. As the product fa((X.)gy(X.) 
for a n 13 = 0 is the number of paths from u to w via v for the given allocation. Theorem 
I j.iH provides a bound on the expected maximal number of such paths over all allocations in 
terms of the product of the expectations of the maximal number of paths from u to w and 
from w to V when the paths are created separately. Though finding the optimal allocation 
may be demanding, the upper bound can be computed simply, for this case in particular by 
monotonicity of fa(x.), gy(x) in a and (3 for fixed x, implying that the maximal number of 
paths created separately is attained when using all available edges, i.e. at a = (3 = n. 
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However, the result and the upper bound hold even in constrained situations where the 
existence of more edges does not lead to more paths, that is, in cases where the functions 
fa,gi 3 are not monotone in a and ft. One such case would be where the existence of a 
particular edge mandates that all paths from u to v use it. More specifically, for some fixed 
{*Oiio} suppose that if {io, jo} ^ « and XjjQjo} = 0 then /q,(x) counts the number of paths 
from u to V. On the other hand if = 1 then faf^) counts the number of paths from u 

to V which use the edge {'I’io ;general such f^ will not be monotone. 

This example easily generalizes to paths with multiple waypoints. We may also consider 
directed graphs where for 1 < i ^ j < n the directed edge {vi,Vj) from Vi to Vj exists when 
Xij = 1, the directed edge {vj,Vi) from Vj to Vi exists when Xij = — 1 and Xij = 0 when 
no edge exists. Returning to the graph example following the statement of Theorem \1.1[ 
when the signed edge indicators are independent, ineguality (a) of Theorem M.B 

provides a bound on the expected maximal number of paths from vertices Vi to and Wi to 
W 2 using disjoint edges. Another possible extension is to consider paths between subsets of 
vertices. 

For application of the dual ineguality, consider for example two directed graphs on the 
same vertex set, determined by egually distributed and independent collections of signed edge 
indicators X and Y, each having independent (but not necessarily identically distributed) 
components. Let a, ft C {{i,j) ■ I < i ^ j < n}, and fafX.) be the number of directed paths 
in the graph from vertices u to v which use only X edges {vi,Vj) with {i,j) G a; in the same 
manner, let gg{Y) be the number of directed paths in the graph from v back to u which use 
only Y edges {vi,Vj) with {i,j) G ft. Consider the expected maximal number of paths, over 
all a and ft with a C ft = If, that go from u to v using the X edges a and return to u from 
V using the Y edges ft. Then Theorem li.5l implies that this expectation is bounded by the 
expected maximal number of paths, over all a and P, to move from u to v using a, and then 
returning to u using P, all with X edges, but where edges used on the forward trip may now 
also be used for the return. 


3 Proofs 


3.1 Proofs of Proposition 11.11 and Theorem 11.21 

We first reduce the problem by proving the following implications between the parts of 
Theorems 11.21 and Proposition 11.11 

Proposition 3.1. (a) ^ (c) ^ (d) (a) and (b) (e). 


Proof: (a) ^ (c): Apply inequality (a) to the hnite collections {/xjxeic, and use 



( 15 ) 


(c) ^ (d): Apply /x(x)(x)^L(x)(x) < maxj^nL=0,KGK;,Le£/ k(x)^l(x). 
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{d) (a): Note that the right hand side of ffT^ equals /x(x)(x)^l(x)(x) for some i^(x) G /C 

and -h(x) E C with K{x) fl -h(x) = 0. 

( 6 ) ^ (e): Apply< maxxnL= 0 ,XG^,Le/:/^(x)^^(x). 

(e) ^ ( 6 ): Use the fact that there exist some disjoint A'(x) E 1C, L{x.) E L such that 

max f ,.Jx.)g (x.) = f Ax.)gJx.). m (16) 

KnL=ii ' ^X(x)^ i^L{yiY > ^ > 

K&K.,L&C 

Let Tc be the sigma algebra generated by a collection of sets C. We say IFc is a hnite 
product sigma sub algebra of § when 

C = ^5 Aj G Cjj' , with Ci C §j hnite for alH = 1,..., n. (17) 

It is easy to see that every hnite sigma algebra, IF, contains a subset G, not containing the 
empty set, such that every element of F can be represented uniquely as a disjoint union of 
elements of G. Call G the disjoint generating set of F . 

Our next objective is proving the inequalities of Framework [H to be accomplished by 
proving (d) in Lemma [3Ul We start with a simple extension of inequality (jS]), expressed in 
terms of indicator functions, from hnite spaces to spaces that may not be hnite, but which 
are endowed with a hnite product sigma algebra. 

Lemma 3.1. Let Q be any probability product measure on the finite product sigma algebra 
Fc withC given by (ITT)) . Then, inequality (a) holds when expectations are taken with respect 
to Q, and {fa}a&A){ 9 i 3 }i 3 &B Fc measurable indicator functions. 

Proof: For i = let G* be the disjoint generating set of Fci- By Theorem 11.11 

applied on the space G = YYi=i 

Q{AUB) < Q{A)Q{B). (18) 

Let events A and B be dehned by the indicator functions 

1a(x) = max/„(x), 1 b(x) = max 5 f; 3 (x), (19) 

a p 

and let A^ and Bj^ be the sets indicated by /q,(x) and gyix) respectively. Suppose x E S 
satishes fa{'x.)gy{x) = 1 for disjoint a,/3. Clearly Aa C A, and as fa depends on Ka, we 
have [x]a C Aa ^ A. As a similar statement holds for B, x E AUB, hence, 

max faix)gis{x) < l^ni?(x). ( 20 ) 

an/3=0 

Now (] 20 |) gives the hrst inequality below, (1T8|) the second inequality, and (IT^ the last 
equality in 

Eq ( me,x fa{X.)gyCK)] < QiADB) < Q{A)Q{B) = Uq {max/«(X)| Uq (max^^(X)l .■ 

[Q!n/3=0 J y. a ) y 0 j 

We say a collection of functions is FP if it generates a hnite product sigma algebra 
contained in S; note that a hnite union of FP collections is FP. 
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Lemma 3.2. Inequality (a) is true for P any probability product measure on {S,S), and 
{fa}aeAT { 90 }/ 3 eB! any finite collections of FP indicator functions. 

Proof: Let T-i be the sigma algebra generated by {fa}a&Ai{g0}0&Bi and Q := P|^, the 
restriction of P to the hnite prodnct sigma algebra P. For h an P measnrable indicator 
fnnction, that is, for /i(x) = 1 a( x) for some A E P, we have 

EqH = Q{A) = P{A) = Eph. (21) 

Since the prodnct of P measnrable indicators is an P measnrable indicator, and the same 
is trne for the maximnm, we have by Lemma [3.11 and (l2Ti) . 

Ep \ max UX)gyiX)] = Eq \ max UX)gyiX)] 

[on ^=0 J [00^=0 J 


< Eq\ 

max 
L a 

fa^ 

(X)] 


|maxc/^(X)J 

= 

max 
L a 

fA 

(X)] 

\Efi 

|max^;3(X)| 


Let V denote the collection of all prodnct sets of the form C = {nr= Si, Si G Ci} where 
Ci C are hnite for alH = 1,..., n. Then 

§ = Pj where = [J (22) 

cev 

Lemma 13.31 generalizes the ineqnality from FP indicator fnnctions to § measnrable indi¬ 
cator fnnctions. 

Lemma 3.3. Inequality (a) is true for any probability product measure P and finite collec¬ 
tions {fa}aeA o,nd {gy}i 3 ^s of S measurable indicator functions. 

Proof: For P, S snbsets of AU B satisfying P S = fb and PU S = AU B, we pro¬ 
ceed by indnction on the cardinality of the set S in the statement I(P,S): ineqnality 
(a) is trne when {fa}aennA, { 90 } 0 ennB are hnite FP collections of indicator fnnctions, and 
{fa}aesnA, {90}0£SnB are any hnite collections of S measnrable indicators. Lemmashows 
that I{AU B,^) is trne, and the conclnsion of the present lemma is I{^,AUB). Assnme for 
some snch P,S with S A U B, that I{fP,S) is trne. For 7 G 7 ?. with, say 7 G .A, let Ai 
be the collection of all sets APS snch that (a) holds for f.y = 1a, and when {fa}aennA\{'r} 
and { 90 } 0 ennB are hnite FP indicators, and {fa}aesnA,{ 90 } 0 esnB are any collection of § 
measnrable indicators. The singleton collection f.y is FP for any A E J given by ([22]). 
Therefore, for any Ae J, the nnion f-^,{fa]a&'RnA\{'y},{ 90 ] 0 enoB is FP. By the indnction 
hypothesis, J P M.. Since A1 is a monotone class and J is an algebra which generates S, 
the monotone class theorem implies § C Al. This completes the indnction. ■ 

We now relax the reqnirement that the fnnctions be indicators. 

Lemma 3.4. Inequality (d) is true for any product measure P and finite collections of §> 
measurable functions {fofiaeA ond {gy}y^B which assume finitely many non-negative values. 
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Proof: We prove (d) by induction on m and /, the number of values taken on by the 
collections {fa}aeA^{ 9 i 3 }i 3 &B^ respectively. By Lemma 13731 inequality (a) is true for hnite 
collections of measurable indicators, and hence by Proposition ll.il so is inequality (d). Now 
the base case m = 2, I = 2 follows readily by extending from indicators to two valued 
functions by linear transformation. 

Assume the result is true for some m and / at least 2, and consider a collection {fa}a€A 
assuming the values 0 < ai < • • • < a^+i; a similar argument applies to induct on 1. For 
some k, 2 < k < m, dehne 

Aa,k = {x : /„(x) = ttk}, 

and for Uk-i < a < ak+i, let 

= /a(x) + (a - afc)lA„,,(x), 

the function /„ with the value of replaced by a. We shall prove that for all a G [ak-i, Ofc+i] 
inequality (d) holds with {h^}a&A replacing {fa}aeA- By the induction hypothesis we know 
it holds at the endpoints, that is, for a G {a^-i, a^+i}, since then the collection {h'^}aGA 
takes on m values; clearly, the case a = ak suffices to prove the lemma. 

Given r(x), a function with values in 2"^, with some abuse of notation denote 


/r(x)(x) = sup /„(x). 
Q;:aGr(x) 


(23) 


Note that /i<'(x)(x) in dH]) corresponds to r(x) = {a : C i^dx)}, and similarly for ^i(x)(x); 

for measurability issues see Section O For any function F(x) with values in 2"^, we have for 
all Q, G [dfc—i, , 

Gr := {x : h“r(x)(x) = a, /r(x)(x) ^ {afe_i,afc+i}} = {x ; /r(x)(x) = ak}, 

showing that Gr does not depend on a. 

Let D = Gr for F(x) = A, and note that 

sup h“(x) = air, + sup /«(x)lr,c. 

a a 

Then the right hand side of (d), with {/i“}aGyi replacing {fa}a(£A, equals aS + A, where 
5 = P{D) / sup 5 ^/ 3 (x)dP(x) and A = / sup/a(x)(iP(x) / sup 5 ^/ 3 (x)dP(x) 

J 0 J o J P 

do not depend on a. Now, let E = Cy for F(x) = {a : = A'(x)} and note that 

h“r'(x)(x) = air; + /r'(x)(x)lr;c. Similarly, the left hand side of (d), with {h'}^}a£A replacing 
{fa}aeAy equals aO + rj, where 

0= ^L(x)(x)dP(x) and r]= ^(x)(x)^r(x)(x)dP(x) 

Je Je<= 

do not depend on a. When a G {a^.i, 0 ^+ 1 } the collection ha,a ^ A takes on m values, so 
by the induction hypotheses (d) holds with {h“}Q,g _4 replacing {fa}aeA and we obtain 


a6* + r; < aJ + A, for a G {afc_i, ak+i}- 


(24) 


By taking a convex combination, we see that inequality 
in particular for Uk, completing the induction. 


holds for all a G [a^-i, a^+i], so 
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Lemma 3.5. Inequality (d) is true for any probability product measure P and finite collec¬ 
tions of non-negative S measurable functions {fa}aeA { 5 '/ 3 }/ 3 €B- 


Proof: Lemma [3.41 shows that the result is true for simple functions. By approximating the 
functions fa^gp below by simple functions, fa,k t fa^Qy^k t gy k f oo, and applying the 
monotone convergence theorem, we have the result for arbitrary non-negative functions. ■ 

Lemma 3.6. Inequality (d) is true for countable collections of non-negative S measurable 
functions {fa}aeA and {gy}yeB- 

Proof: For K, L G 2", let 


V2ir(x) 




and 0i,(x) = 


recalling definition (|8]). Noting 

/k(x)(x) = sup /a(x) = sup sup /«(x)= SUp /i^- (x) = (,Px(x) (x), 

a:KaCK(yi) KCK(yi) a:KaCK KCK{yi) 


and 


sup = sup /i^(x) = sup /o(x), 

K K a 

and similarly for {gy}y(zj 3 , the result follows immediately upon applying Lemma [33] to the 
finite collections {pk}k& 2 ^ and {(f>L]L& 2 ^- ■ 

By Proposition 13.11 at this point we have completed proving all inequalities pertaining to 
Framework [T] The next proposition connects the two frameworks and completes the proof 
of Theorem 11.21 and again applying Proposition 13.11 that of Proposition II.11 

Proposition 3.2. Inequality (a) holds in FrameworkU\for all collections {fQ}a&Ai{gy}y&B 
of given functions, if and only if inequality (b) holds in Framework\^for all given functions 
f and g and collections fC and C. 

Proof: (a) (b). For L C n let Pl(x) denote the marginal of P in the coordinates indexed 

by L. Let functions /, g and collections fC and C of subsets of 2" be given. 

By Fubini’s theorem, for any iL C 2", 


^ /W) = j ^ = j j ^ /(x))dPic.(x)dPff(x) 

= j < /(x))dPx(x) = j IdPi^(x) = 1, 

where the fourth equality holds by definition of the essential inhmum. As /C is finite, 

^ /W) = 1’ implying E |max/^(x| < P{/(X)} , 

with a similar inequality holding for g. Now we see that (b) holds by applying (a) to the 
collections {/^^(x)}kgac and {g^{x)}Lec as in (g]). 
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(6) ^ (a): Given collections of fnnctions fa, 90 depending on KajLj^, define 

/(x) = snp/„(x) and ^(x) = snp ^^^(x). (25) 

a 0 

Now letting f 9 ^ be as in (jl]), we have 

/a(x) = far. (x) < / (x) and likewise ^y 3 (x) <g (x). 

-/\q; -/\q: --t^/3 

Now, for a, 13 disjoint, 

/a(x)^/ 3 (x) < / (x)fl (x) < max / (x)fl(x). 

—Ka KnL=li 

-K'e2",Le2" 

Taking snpremnm on the left hand side over all disjoint a, /5 and then expectation, the resnlt 
now follows by applying ineqnality (b) and (1251) . ■ 

3.2 The Dual Inequality 

As observed in [6], the techniqnes in [10] extend the dnal ineqnality (|6|) from nniform measnre 
on {0,1}” X {0,1}” to any prodnct measnre on a discrete finite prodnct space S. Specifically, 
Lemmas 3.2(iii), 3.4, and 3.5 of [ID] carry over with minimal changes, essentially by replacing 
□ by 0 and fl by x appropriately; for example, the dnal version of Lemma 3.4 wonld begin 
with the identity 


{fxf)-0AOB)= y {(/x/)-i(C, XC 2 )} 

Cl,C2 

where the nnion is over all Ci, C 2 snch that Ci is a maximal cylinder of A, C 2 is a maximal 
cylinder of B, and Ci T C 2 ', see Sections 3 and 2 of (TD] for the formal definitions of maximal 
cylinder, and perpendicularity T, respectively. 

Now the proof of Theorem 11.51 and Proposition 11.21 follow in a nearly identical manner to 
that of Theorem 11.21 and Proposition 11.11 For instance, to prove (d') , consider 

Cv = {(x,y) : /r“r(x,y)(x) = a, /r(x,y)(x) ^ {afc-i,afc+i}} = {(x,y) : /r(x,y)(x) = ak). 

Setting D = Cy for P(x, y) = {a : Ka = A'(x, y)} we can write the left hand side of (d') as 
aO + rj, with 

d= ^L(x,y)(y)dP(x)dP(y) and 9 = 7^(x,y)(x)^i(x,y)(y)dP(x)dP(y), 

Jd J d<^ 

and using P = Gr for P = A, the right hand side becomes ah + A, where 

h= / sup ( 7 ^(x)(iP(x) and A= / sup/a(x) 5 '^(x)(iP(x) 

Jd 0 a ,0 

with 9, ? 7 , 6 and A not depending on a. 
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4 A PQD ordering inequality 

Consider a collection {/q(x)}™^^ of functions which are all increasing or all decreasing in 
each component of x = (xi,..., Xn) G R". Let X = (Xi,..., X„) G R"' be a vector of 
independent random variables, Y = (Yi,... ,Yn) an independent copy of X, and for each 
a = 1, ..., m, let Ha Y n, and 


Zq — (Yi^a, ■ ■ ■ , ^n,a), (26) 

where Zi^a = Y^ if i E Ha, and Zi^a = if i ^ Ha- Now let 

U=(/i(Zi),...,/^(Zj) and V = (/i(X),...,/^(X)). (27) 

Inequalities between vectors below are coordinate-wise. When (|28ll below holds, we say that 
the components of V are more ‘Positively Quadrant Dependent’ than those of U, and write 

U <PQD V. 

Theorem 4.1. For every c = {ci,, Cm) G R™ and Ha C n, a = 1,..., m, 

R(U > c) < P(V > c) and P(U < c) < P(V < c). (28) 

Proof: Since (l28|l holds for U, V if and only if it holds for —U, —V, by replacing the col¬ 
lection by {—/q,(x)}™^]^ when the functions are decreasing, it suffices to consider 

the increasing case. 

For k E {0,... ,n} let H^ = Ha Cl {0,..., A:}, and with Ha replaced by H^, let Z^ and 
be dehned as in (|26|) and respectively. We prove the hrst inequality in (|28|) by induction 
on k in 


P(U^ > c) < P(V > c); (29) 

the second inequality in fl25]) follows in the same manner. Inequality fl2UD is trivially true, 
with equality, when k = f), since then H^ = 0 and Z^ = X for all a G m. Now assume 
inequality (12^ is true for 0 < A: < n and set 

B = {a ■. k + 1 E Ha}. 


Then 

P(U^+^ > c) 

= P(/i(z^+Q>ci,...,/^(z^+Q>cJ 
= P[P(/i(Z^+') > Cl,..., > c^\Xu Yi,l^k + l)] 

= P[P(/„(Z^) > c„, a ^ B\Xi, Yi,l^k + 1)P(/„(Z^+Q > c„, a G P|W, Yi,lj^k + 1)] 
= P[P(/„(Z^) >Ca,a^ B\Xi, Yi,l^k + 1)P(/„(Z^) > c„, a G P|W, Yi,l^k + 1)] 

< P[P(/i(Z^) >ci,...,/™(Z^) >c™)|W,y,/7^A: + l] 

= P(U^ > c) 

< P(V > c). 


where the third equality follows from the independence of and Yfc+i and the fourth from 
the fact that {fa{’Zia)}aeB has the same conditional distribution when either or Yfc+i 
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appears as the k + 1®* coordinate of the Z vector; the hrst inequality follows from the fact 
that conditioned on X;, Yi,l ^ k + 1, the functions are all increasing in and are 

therefore (conditionally) associated, and the second inequality is the induction hypothesis 
(I29j). In fact, for the hrst inequality above it suffices to see that the product of the two 
probabilities conditioned on X;, h), / 7^ A; + 1 is the product of (conditional) expectations of 
two increasing functions of which is smaller than the (conditional) expectation of the 

product. ■ 

Taking c = (c,..., c) we immediately have 

Corollary 4.1. For all c G R, 

P(max/Q,(ZQ,) < c) < P(max/„(X) < c) or equivalently max/„(X)< 52 -max/„(Z q). 

a a a a 

Application 1. Consider the framework of Theorem 11.21 with /„(x), 5 f^(x), a G A,/I G P 
all increasing or all decreasing functions which depend on coordinates Aq,,L^. Dehne P to 
be a collection of functions 


V = iUX) + g^iX) : n = 0}, 

and for Y = (Yi,..., Y„) as above, set 

V* = {/„(X) + g0{Y) :K^nL0 = 0}. 


By Theorem 14.11 we have 


Applying Corollary 14.11 


V* < 


^PQD 


V. 


max {/„(X) +^^(X)} <ST max {/„(X) + ^/ 3 (Y)}. 
on^=0 on^=0 


Exponentiating the last relation and replacing e^°‘ by /„, using obvious properties of the 
max, we obtain 

max{/„(X) 5 (^(X)} < 5 r max {/„(X) 5 ^/ 3 (Y)}. (30) 

an/ 3=0 00^=0 

and therefore 


P{max fa,{X)gfi{X)} < E{ max /„(X) 5 f^(Y)} < P{max/(X)} P{max 5 ((X)}, 

an/3=0 Q:n^=0 O /3 

for nonnegative monotone functions /„ and gp. Thus the relation fl30l) is stronger than the 
BKR inequality for monotone sets, which was proved in HI]. Alexander [T] presents similar 
functional versions in this context. 

As an example we return to order statistics as in Section 12.11 From fl30l) we derive, for 
example, that 

Y[i]A[2] <ST Y[i]Y[ 2] V y[i]X[2]. 

Generalizing by using the functions (ITT]) , we obtain for any p + q = m, 


If’^i 

i=i 


[i] 


max 
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5 Appendix on Measurability 

In this section we briefly deal with various measurability issues. The measurability of the 
functions defined in ([9]) can be seen from 

/i^(x)(x) = ^/x(x)l(/T(x) = K), 

K 

since the given function i^(x) is assumed measurable. Similarly for (|23|) . 

/r(x)(x) = ^ sup/„(x)l(r(x) = A). 

We next prove that given a non-negative, (S, B) measurable function / : S' —?■ R and any 
/C C n, the function /^(x) defined in (jl]) is (§, B) measurable. Letting 

/^(x) = min(/(x),r) 

and Rl(x) be the marginal of T’(x) on the coordinates xj;,, we have 

\im ( [{r - fr{x)YdPK'={x)\ = ess sup (r -/^(y)) = r - ess inf /^(y). 

P^°° \J J ye[x]K yA^]K 

Tonelli’s theorem (see e.g. ID) now implies that ess infyg[x]^/r(y) is measurable. Letting 
r "I' oo shows that (j4]) is measurable. 

The only complication regarding measurability of the pair (iL(x),L(x)) in (fT6|) is that 
the maximum may not be uniquely attained, since otherwise we would simply have 

{x : iL(x) = K, L(x) = L} = Q {x ; /^(x)^^(x) > /^-,(x)^^,(x)}, 

K'nL'=(D 

a finite intersection of measurable sets, so measurable. To handle the problem of non¬ 
uniqueness, let -< be an arbitrary total order on the finite collection of subsets of n x n, so 
that when the max is not unique we can choose (iL(x),L(x)) to be the first disjoint pair 
that attains the maximum. Then {x : iL(x) = K, L(x) = L} = F HG where 

F = Pi Lk ^^)Il > Lk' } 

(K\L')^(K,L) 

i^'nL'=0 

and 

G= n 

(K' ,L')y(K,L) 

K'rL'=% 

and again measurability follows. Similar remarks apply to the maximizing iL(x),L(x) in 
(IT^ for the implication (d) ^ (a). 

Finally, we note inequality (a), and therefore also (b), holds on the completion of (S',S) 
of (S', S) with respect to P. Proposition 2.12 of [1] shows that for every § measuable function 
/ there exists an § measurable function / such that f = f with (completed) measure one. 
Hence, replacing all S measurable functions in (a) by their S measurable counterparts and 
applying (a) over the space (S', S) shows (a) holds on (S', S). 

Acknowledgment: The authors would like to thank Richard Arratia for pointing out to us 
that ([5]) is an inequality in general, and for other insightful comments. 
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